Facilitating speech detection in style!: the effect of visual speaking style on the detection of speech in noise

نویسندگان

  • Nicole Lees
  • Denis Burnham
چکیده

Speakers naturally modify the way they produce speech depending on the listening environment. A hyper-articulatory speaking style is typically employed when listening circumstances are difficult. In contrast, hypo-articulation is adopted when the elements for listening are favourable. Previous research suggests that the intentions of the speaker are realized in listener’s recognition of spoken utterances [1]. Given recent findings that the sight of articulating faces increase the detectability of speech in noise [2, 3, 4, 5]; the present study investigates whether speech in noise is more detectable when the listener views hyper-articulated compared to hypo-articulated speech. Normal, Hyperand Hypo-articulated styles of “ba, bi, bu, da, ga, tha” spoken by three speakers were paired with corresponding static images (auditory information only) or dynamic (audio-visual condition) visual articulations in a two interval (noise only/speech plus noise) forced choice detection task at three signal-to-noise ratios (0dB, 2dB, and -4dB). Seeing the articulating face provides a significant advantage for detecting speech in noise compared to auditory only presentation regardless of speaking style. Additionally, hyper-articulated speech offered a significant advantage over hypo-articulated speech suggesting that the amount of facial movement may modulate the AV facilitation effect in the detection of speech in noise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

The effect of modality and speaking style on the discrimination of non-native phonological and phonetic contrasts in noise

Auditory speech is difficult to discern in degraded listening conditions, however the addition of visual speech can improve perception. The Perceptual Assimilation Model [1] suggests that non-native contrasts involving a native phonological difference (two-category assimilation) should be discriminated more accurately than those involving a phonetic goodness-offit difference (category-goodness ...

متن کامل

A survey on speech style of Qazvin’s women based on age and education

Language is a social factor connecting people together.  There are different speech styles among individuals, this difference is due to the situation and the context they are in. The purpose of this paper is to study speech style of Qazvin’s women in different situations. The authors have sought to find answers to the following questions:  What is the relationship between speech style and age a...

متن کامل

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

L2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors

This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005